A Nearly- Quadratic Gap Between Adaptive and 
Non- Adaptive Property Testers 



Jeremy Hurwitz* 

Department of Computing and Mathematical Sciences, California Institute of 
Technology, Pasadena California 91125 
jliurwitz@cs . caltech.edu 

Abstract. We show that for all integers t > 8 and arbitrarily small 
e > 0, there exists a graph property 77 (which depends on e) such that 
£-testing n has non-adaptive query complexity Q = 0{q^~^^*), where 
q — 0{e~^) is the adaptive query complexity. This resolves the question 
of how beneficial adaptivity is, in the context of proximity-dependent 
properties ([9]). This also gives evidence that the canonical transforma- 
tion of Goldreich and Trevisan ([8]) is essentially optimal when convert- 
ing an adaptive property tester to a non-adaptive property tester. 
To do so, we provide optimal adaptive and non-adaptive testers for the 
combined property of having maximum degree 0{eN) and being a blow- 
up collection of an arbitrary base graph H. 
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1 Introduction 

In this paper, we consider the power of adaptive versus non-adaptive queries for 
property testers in the dense-graph model. In this model, the algorithm is given 
access to the input graph G = {[N],E) via an oracle g : [N] x [A^] {0,1} 
such that g{u, u) = 1 if and only if {u, v) £ E. A graph is said to be e-far from 
a particular property if at least eN^ edges must be added or deleted to yield a 
graph with the desired property. 

Given a graph property U, we say that a (randomized) algorithm A is an 
e-tester for 77 if Pr[^(G) = Accept] > 2/3 for all G e n and Pr[A{G) = 
Reject] > 2/3 for aU G which are e-far from 77. 

1.1 Adaptive, Non- Adaptive, and Canonical Testers 

Property testers can be broadly classified into two types according to how the 
queries are determined. In an adaptive algorithm, the results of previous queries 
can be used when determining the next query. A non-adaptive algorithm, on the 
other hand, determines all of its queries in advance. 

Most of the initial work in the dense-graph model focused on non-adaptive 
algorithms. In fact, these early algorithms \iscd an even more restricted frame- 
work, termed canonical algorithms by [8]. In a canonical algorithm, the tester 
chooses a random subset of the vertices and queries the entire induced subgraph. 

* Supported by NSF grants CCF-0830787 and CCF-0829909 



Given a property JT and an error parameter e > 0, let q be the adaptive query- 
complexity, Q be the non-adaptive query complexity, and Q be the canonical 
query complexity. By definition, q < Q < Q. A natural question, then, is to 
determine the exact relationship between these parameters. 

In [8] and [2], the authors note that any algorithm which makes q queries 
considers at most 2q vertices. It therefore suffices to query the subgraph induced 
by a random set of 2q vertices and then locally simulate the adaptive algorithm. 
This canonical transformation shows that, for any property, Q < 2q^. 

It is easy to see that there exist properties with Q = 1?((5^),^ thereby showing 
that the canonical transformation is optimal when converting (adaptive or non- 
adaptive) algorithms into canonical algorithms. Indeed, [5] and [7] show that a 
quadratic performance gap exists between non-adaptive and canonical testers 
for many natural properties. 

Similarly, we can consider the relative power of adaptive versus non-adaptive 
(but not necessarily canonical) property testers. It is widely believed that the 
canonical transformation remains optimal, in the worst case, even for this more 
modest goal. In other words, it is believed that there exist properties such that 
Q = ^2{q^)- However, proving such a separation between the adaptive and non- 
adaptive complexities has proven elusive - no unconditional gap was known until 
Goldreich and Ron demonstrated a property where Q = fi{q^/'^) in [7]. 

In light of this difficulty, researchers have considered two modifications of the 
problem. The first approach, used by [9], considers proximity- dependent proper- 
ties which depend on the tolerance parameter e. In particular, they considered 
the combined property of being bipartite and having degree at most O(eiV), and 
achieved a gap of Q = 0(g~^/^). 

The second approach, used by [7], is to consider promise problems. In this 
context, the authors showed that there exist properties such that Q = n{q~'^^^) 
for all (5 > 0, exhibiting the first nearly-quadratic gap between adaptive and 
non-adaptive queries in the dense-graph model. More specifically, the authors 
demonstrated a hierarchy of gaps of the form Q = 6'((j'~^+^/*) for each integer 
t >2. Unfortunately, the promise they use, while natural, is quite strong, and it 
is currently unclear how to remove the promise. 

In this paper, we achieve a nearly-quadratic gap without using a promise, 
at the expense of making the properties proximity-dependent. This proves that 
for all (5 > and arbitrarily small e, there exists a property 77 (which depends 
on e), such that e-testing 7T requires Q > q^~^ non-adaptive queries, where q 
is the adaptive query complexity. As in [7], we also strengthen the result and 
establish a hierarchy of relationships between the adaptive and non-adaptive 
query complexities. 

Theorem 1 (Main Theorem). For all t > 8 and arhitra/rily small c, there 
exists a graph property U (which depends on e) such that e-testing 11 has non- 
adaptive query complexity Q = 0{q^~^/^), where q = 0{e~^) is the adaptive 
query complexity. 

^ Let n consist of only the empty graph. Then Q = 0(e~^), while Q = fi{e~^). 



Theorem 1 and [7] both provide strong evidence that the canonical trans- 
formation is optimal in the general case. Although each result technically leaves 
open the possibility of a better transformation between adaptive and non-adaptive 
testers, each does so in a different, and very restricted, way. As a result, any such 
transformation would have to be very unnatural and would have to depend sen- 
sitively on the internal structure of the adaptive tester. 

1.2 Graph Blow-Ups and Blow-Up Collections 

A graph blow-up consists of replacing each vertex of a graph with a cluster of 
vertices (a rigorous definition is given in section 2). This operation is frequently 
used in studying the dense-graph model (see, for example, [1], [3], and [6]). 

The complexity of testing whether a graph is a blow-up of a fixed graph H 
was essentially resolved in [4] (see also [5]), where it is shown that, for any H, 
the adaptive query complexity is 0(e~^) and the non-adaptive query complexity 

is d{c-^). 

A graph is a blow-up collection if it can be partitioned into disjoint subgraphs, 
each of which is a blow-up of H. This notion was implicitly introduced in [7], 
which showed the following lower bound. 

Lemma 2 ([7], Lemma 5.6). Let H be a simple t-cycle, with t > 4. Test- 
ing whether G is a blow-up collection of H requires J?(e~(^~^/*'') non-adaptive 
queries, even given the promise that G has maximum degree 2teN. 

In this paper, we prove tight upper-bounds for both the adaptive and non- 
adaptive cases. Specifically, we show that a tester can determine whether G is 
a blow-up collection of any given H, given that same promise on the degrees, 
using only 0(e~^) adaptive queries or 0(e-2-i-i/(zi+2) ^ non-adaptive 
queries, where A and W are parameters depending only on H. When H is a. 
simple i-cyclc, A = 2 and W = t, and the non-adaptive upper bound reduces to 
0(e~^+^/*), matching the lower bound in Lemma 2. 

Indeed, it suffices for G to be 0(e)-close to satisfying the promise. Since [9] 
gives an cfRcient non-adaptive tester for the property of having maximum degree 
0{€N) ([9]), we obtain the following two theorems. 

Theorem 3 (Adaptive Tester). For all graphs H and constants c > 1, there 
exists an adaptive property tester (with two-sided error) for the (proximity- 
dependent) combined property of having maximum degree ceN and being a blow- 
up collection of H. The tester has query complexity 0{e~^ Ig^ 

Since any tester must make f2{e~^) queries. Theorem 3 is optimal up to a poly- 
logarithmic factor. 

Theorem 3, combined with Lemma 2, suffices to show a gap of size Q = 
f2{q~'^^^) for all ^ > 0. In the following theorem, we strengthen this result by 
proving a tight upper-bound on the non-adaptive query complexity. This shows 
that there is an infinite hierarchy of achievable relationships between the adap- 
tive and non-adaptive query complexities of proximity-dependent properties. 



Theorem 4 (Non- Adaptive Tester). For all graphs H and constants c > 
1, there exists a non-adaptive property tester (with two-sided error) for the 
(proximity- dependent) combined property of having m.axim.um degree ceN and be- 
ing a blow-up collection of H. The tester has query complexity 0(e~^+^/(^+^) + 
w/iere A = deg{H) is the maximum degree of H and W < \Hf is a 
bound on the size of a witness against H (see Definition 9). 

As mentioned previously, when H is a simple t-cycle, A = 2 and W = t. 
Therefore, combining Theorems 3 and 4 with Lemma 2 yields Theorem 1. 

We note that the algorithms in both theorems have running time polynomial 
in the query complexity. In the adaptive case, the query complexity can also be 
made polynomial in c and \H\. However, we do not do so here, opting instead to 
simplify the proof. 

1.3 Organization of The Paper 

Section 2 contains basic definitions and notation. In section 3, we show how to 
adaptively test whether a graph is a blow-up collection, given the promise that 
the graph is close to having maximum degree ceN. In section 4, we present the 
non-adaptive algorithm, under the same promise. Finally, section 5 removes the 
promise by explicitly testing that the input is close to having low degree. 

2 Notation and Basics 

All graphs are assumed to be undirected. Following standard graph-theoretic 

notation, we let r{v) = {u : {u, v) E E} denote the neighbors of v. Given 
S C V, we let G\s denote the subgraph induced by S and rs{v) = r{v) (1 S 
denote the neighbors of v in S. Given S,T cV, we let E{S,T) = {{u,v) G E : 
u € S,v € T} denote the set of edges between S and T and S A T denote the 
symmetric difference of 5* and T. 

Definition 5 (Graph Blow-Up). A graph G = {[N],E) is a blow-up of the 
graph H = ([/i], F) if there exists a partition of [N] into Vi U . . . U V/j such that 

for every i,j E [h] and {u,v) £ Vi x Vj , {u,v) G E if and only if G F. We 
denote the set of blow-ups of H by BU{H). 

Note that no requirement is made as to the relative sizes of the Vj . In particular, 
we allow the case where \Vj\ =0. 

Definition 6 (Blow-Up Collection) . A graph G = ( [N\ ,E) is a blow-up 
collection of the graph H if there exists a partition of [N] into U . . . U F*^, for 
some k, such that G\v^ e BU{H) for all i and E{V\ V^) = for all i ^ j. We 
denote the set of blow-up collections of H by BUC{H). 

Throughout this paper, H will be an arbitrary fixed graph and c > 1 will 
be an arbitrary fixed constant. We let h = \H\, A = deg{H), and CVce = {G : 

deg{G) < ceN}. 

To prove Theorems 3 and 4, we will repeatedly use the concept of a set being 
{k, a)-partitionable. Informally, a vertex is {k, a)-partitionable if almost all of its 



neighbors can be partitioned into k groups, such that all of the vertices within 
a part have essentially the same neighbors in G. The formal definition is given 
in Definition 8, after we introduce notation for such partitions. 

Definition 7. Given v e V and a > 0, letCy^a{u) = {w e r{v) : \r{w) A r{u)\ 
< aeN}. 

In other words, Cy^a{u) consists off all vertices in r(v) which have essentially 
the same neighbors as u. When v and a are clear from context, we will omit the 
subscripts and write G{u) for Cy^a{u)- 

Definition 8 ((fc, a)-Partitionable). A vertex v is (fc, Q!)-partitionable if there 
exist representatives ui,. . . ,Uk G r{v) such that Ui=i (^v,a{ui) > \r{v)\ — aeN . 

Note that if G G BUC{H), then every vertex is {deg{H), 0)-partitionable. 

Finally, we need the idea of a minimal witness against BUC{H). Informally, 
a minimal witness is a set of vertices such that the induced subgraph proves that 
G is not a valid blow-up collection, while any subset does not suffice. 

Definition 9 (Minimal Witness). A set S CV is a minimal witness against 

BUe{H) ifG\s^ BUC{H), hut for any S' CS,G\s' BUC{H). 

Note that G\s ^ BUC{H) impUes that G ^ BUC{H). Furthermore, any 
minimal witness must be connected, since we could otherwise replace the witness 
with one of its connected components. 

Given H, we let W = W{H) denote the maximum size of a minimal witness. 
It is easy to see that W < \h^. When iJ is a simple t-cycle, t>A:,W=\H\. 



3 Adaptively Testing BUC{H) Given a Promise 

Wc begin by showing how to adaptively test whether a graph G is in BUC{H), 
given the promise that G is 0(e)-close to LV^e- 

The algorithm consists of two stages. The first stage (steps 2-8) tests whether 
most vertices are (A, 0)-partitionablc, where A = deg{H). To do so, it repeatedly 
selects a vertex and then attempts to find A + 1 neighbors of that vertex which 
have mutually distinct neighborhoods (see Figure 1(a)). If it finds such a set of 
vertices, they act as a witness against being in BUC{H) and the algorithm can 
reject with certainty in step 7. 

If the algorithm fails to find such a witness, then most vertices must be 
(Z\, Q;)-partitionable, where o; is a small constant. This implies that the graph 
must be close to being a blow-up of some base graph. In other words, almost 
all of the vertices can be clustered such that each pair of clusters is either al- 
most disjoint or almost forms a complete bipartite graph. The second stage of 
the algorithm (steps 9-16) checks if this high-level structure is consistent with 
BUC{H) (see Figure 1(b)). It does so by performing a random search in G for 
W = W{H) steps, where each step selects a random neighbor of the previously 
selected vertices. If the resulting W vertices form a witness against BUC{H), the 
algorithm rejects in step 15. 



Algorithm 10. AdaptiveBlowUpCollectionTesth,c(G; e) 
1: Let A = deg{H), W = W{H), and a = {16A \Hf)-'^. 

\\ Test whether most vertices are {A,0)-partitionable. 
2: for 0(c) iterations do 

3: Select a random vertex v. 

4: Select a random set S of 0{Aa~^e~^) vertices, and query v x S. 
5: Let 5 be a random subset of Fsiv) of size (at most) cAa~^. 
6: Select a random set T of 0{A'^a~^e~^) vertices, and query S x T. 
7: If the current view of G is inconsistent with BUC{H), Reject. Specifically, 
reject if there exist ui, . . . ,ua+i € S such that /T(wj) 7^ I^T{uj) for all 

i,.h 
8: end for 

\\ If G has not yet been rejected, most vertices must be (A, a)-partitionable. 

\\ Test that G has a high-level structure consistent with H. 
9: for 0(Wc)°(^) iterations do 
10: Select vi at random, and let U = {vi}. 
11: for J = 2 to do 

12: Select a random set Tj of 0{Ae~^) vertices, and query U x Tj. 

13: If Ft (U) = 0, break. Otherwise, randomly select vj G Ft^ {U) and let 

U = UU{vj}. 
14: end for 

15: If G|c/ ^ BUC{H), Reject. 
16: end for 

17: If the algorithm hasn't yet rejected. Accept. 

Algorithm 10 has query complexity 0(e~^), as desired. Furthermore, it only 
rejects if it finds a witness against BUC{H), so the algorithm accepts valid blow- 
up collections with probability 1. The following lemma asserts that it rejects 
graphs which are far from BUC{H) with high probability. 

Lemma 11. LetG he ^-close to jCVce and e-far from BUC{H). Then Algorithm 
10 rejects with probability at least 2/3. 

The proof is the content of section 3.1. 

3.1 Proof of Lemma 11 

We first note that if G contains many vertices which are not {A, a)-partitionable, 
then G is rejected with high probability. 

Lemma 12. Let A = deg{H), W = W{H), and a = {16A\H\^)-\ Suppose 
that G is ^-close to CV^e and contains at least j^N vertices which are not 
{A, a)-partitionable. Then Algorithm 10 rejects in step 7 with probability at least 
2/3. 

The proof follows immediately from the definition of (Z\, Q;)-partitionability, 
and is given in appendix A.l. 



We now show that if G has at most j^N vertices which are not {A,a)- 
partitionable, then G can be partitioned into components such that, for each 
pair of components, either the edges between them almost form a complete 
bipartite graph or the components are almost disjoint. 

Lemma 13. Let A = deg{H) and a = (16Z\ \H\^)~^ . Suppose that G is ^-close 
to CD a and contains at most -^N vertices which are not {A,a)-partitionable. 
Then G is ^-cZose to a graph G = {V,E) for which the following holds: V can 
be partitioned into IJ - jV^U L such that 

1. f{v) = for all V 

2. E{V\V' ) = % for alii T^i'. 

3. For all i,j, f (u) = f{v) for all u,v G Vj. 
4- > jtiN for alii, j. 

where = IJj Vj and r{u) is the neighborhood of u in G. Furthermore, 



Fviiu) A Fyiiu) 



< aeN for all u&V^ 



The proof is given in appendix A. 2. 

Note that the bound on Fyi (u) A Fyi (u) in the lemma implies that G is 

|-close to having maximum degree 2ceN, since G is |-close to having maximum 
degree ceN jmd the degree of each vertex increases by at most aeN when going 
from G to G. Therefore, conditions 3 and 4 imply that most of the components 
Vj are connected to at most 32cZ\ other components and have size at most 2ceN. 

Intuitively, these conditions allow us to view each cluster as a supernode and 
each bipartite graph as an edge. From this viewpoint, G becomes a bounded- 
degree graph, with maximum degree 32cZ\. Since each supernode contains f2{eN) 
vertices (by condition 4 in the lemma), we can simulate neighbor queries by 
qucirying 0(c^^) random vertices and choosing a random neighbor. Furthermore, 
if G is i7(e)-far from BUC{H) when viewed as a dense graph, the corresponding 
bounded-degree graph is /?(l)-far from BUC{H). 

To formalize this intuition, we first show that the set of witnesses against G 
covers a constant fraction of V. 

Lemma 14. Let G be as in Lemma 13, and suppose that G is '^-far from 
BUC{H). Then there exist at least {lQWc^e)~^ distinct sets Wk = [JeVJ^f. such 
that G\^^ is a witness against BUC{H). 

The proof is given in appendix A. 3. 

We are now ready to show that the second half of Algorithm 10 finds a 
witness against G with high probability. 

By Lemma 14, step 10 of the algorithm selects a vertex vi G VJ^ correspond- 
ing to a witness with high probability. Having chosen v\, we wish to bound the 
probability that V2, ■ ■ ■ , vw are chosen so as to form a complete witness. Recall- 
ing that any minimal witness is connected, there must exist a, V2 G Vj^ which 
extends the witness in G. By condition 3 of Lemma 13, any vertex in V? can 



be used in place of V2 to extend the witness. However, since \Vj^\ > and 
Fyi {u) A Fyi (u) < aeN < | ^2 1 ' least half of V^^ is a valid choice to 
extend the witness in G. Furthermore, most of these must have degree 0{ceN). 

Iterating this procedure, we see that there are always at least n{-^N) choices 
for Vj which extend the witness in G. Furthermore, since each previous Vji was 
chosen to have degree 0{ceN), there are at most 0{WceN) potential choices for 
Vj. Therefore, with probability f2{{W'^c)^^), Algorithm 10 chooses a good vj at 
each step and finds a complete witness in G. 

The formal proof is given in Appendix A. 4. 



4 Non-adaptively Testing 3U.C{H) Given a Promise 

We now show how to non-adaptively test whether a graph is in BUC{H), given 
the promise that G is 0(e)-close to CVce- 

As in the adaptive case, the first stage of the algorithm verifies that most 
vertices arc (Z\, a)-partitionable. Assuming the graph passes the first stage, the 
second stage checks that the high-level structure of G is consistent with BUC{H). 

Since we can no longer adaptively restrict our queries to neighbors of a given 
vertex, we instead rely on the birthday paradox to achieve sub-quadratic query 
complexity. Recall that the birthday paradox says that given any distribution 
over a discrete domain D, 0{\D\^~^^'^) samples suffice to obtain a k-way collision. 

Lemma 15 (Birthday Paradox). LetV be a finite domain, and let {/Zi} be a 

set of probability distribution overV with ^i{d) = f2{l/ II^I) for all i and d € P. 
Suppose that the i-th sample is drawn according to jii. Then 0{\'D^ ^/^^ samples 
suffice to obtain a k-way collision luith high probability. 

Very informally, a collision will correspond to choosing multiple vertices from 
a single witness in such a way that a H^-way collision corresponds to a complete 
witness. Assuming that the set of witnesses has size 0(e^^) and that we can 
sample from that set with only constant overhead, the birthday paradox implies 
that 0{e~^^^^^) random vertices suffice to find all W vertices corresponding 
to a complete witness. Querying the entire induced subgraph then yields the 
desired result. The primary challenge, therefore, is to show that when G is e-far 
from BUC{H), the algorithm can efficiently sample from the space of witnesses. 

Algorithm 16. NonAdaptiveBlowUpCollectionTesth,c(G; e) 

1: Let A = deg{H), W = W{H), and a = {16A \Hf)-^. 

\\ Check that most vertices are {A,0)-partitionable. 

2: Select a set S of 0((ae)~-^+^/(^+^^) random vertices, and query 5x5. 

3: Select a set T of 0{A'^a.^^e^^) random vertices, and query 5 x T. 

4: If the current view of G is inconsistent with BUC{H), Reject. Specifically, 

reject if there exist v G S and ui, . . . ,ua+i € rs{v) such that rT{ui) ^ 

rT{uj) for all 



\\ Check that G has a high-level structure consistent with H. 



5: Select a set S of 0((Zic^e) i+i/'^) random vertices and query S x S. 

6: If G\s ^ BUC{H), Reject. 

7: If the algorithm hasn't yet rejected, Accept. 

Algorithm 16 has query complexity 0(e~2+i/(/i+2) _|_ (;iesired. 
Since it only rejects if it finds a witness against G, it accepts all graphs in 
BUC{H) with probability 1. It remains to show that it rejects graphs which are 
far from BUC{H) with high probability. 

Lemma 17. Let a = (16Zi \H\^'^)^^ , as in Algorithm 16, and let G he ^-close 
to CVce and e-far from BUC{H). Then Algorithm 16 rejects with probability at 
least 2/3. 

The proof is given in appendix B. 



5 Removing the Low-Degree Promise 

We now show how to test the combined property of being a valid blow-up col- 
lection and having low-degree. ^ 

In [9], the authors give an 0(e~^)-query algorithm for testing whether the 
input has maximum degree 0{eN). 

Lemma 18 ([9], Theorem 3). Fix c > 1 and (3 > 0. There exists a non- 
adaptive tester with query complexity 0{e~^) and two-sided error which accepts 
graphs with maximum degree ceN with probability at least 2/3 and rejects graphs 
which are fie- far from, homing maximum degree ceN with probability at least 2/3. 

To test whether G is in BUC{H) (1 LVce, we therefore run the tester from 
Lemma 18, and if it accepts, we then run either the adaptive or non-adaptive 

tester for BUC(H). 

All that remains is to show that if G is e-far from BUC{H) fl CV^t, then it 
must be Q{e)-iBX from BUC{H) or /2(e)-far from CDct- 

Lemma 19. Suppose that G is -close to CD and ^-close to BUC{H). Then 
G is e-close to CD n BUC{H). 

The proof of Lemma 19 is given in appendix C. 
We now prove Theorems 3 and 4. 

Proof (Theorems 3 and 4)- We first run the tester from Lemma 18 with (3 = 
{18cA{H)'^)~^ . If it accepts, we then run Algorithm 10 (in the adaptive case) or 
Algorithm 16 (in the non-adaptive case) with tolerance |. 

If G e CT>ce n BUC{H), then the low-degree tester accepts with probability 
at 2/3 and blow-up collection tester accepts with probability 1. So G is accepted 
with probability at least 2/3. 

Suppose that G is e-far from CT>r\BUC{H). By Lemma 19, cither G is ig§^- 
far from CD or G is |-far from BUC{H). In the first case, the low-degree tester 
rejects with probability at least 2/3. In the second case, the blow-up collection 
tester rejects with probability at least 2/3, by Lemma 11 (in the adaptive case) 
or Lemma 17 (in the non-adaptive case). □ 



6 Conclusions 



We have shown that there exist proximity-dependent graph properties for which 
a non-adaptive tester must suflFer an almost-quadratic increase in its query com- 
plexity over an adaptive tester. This shows that the canonical transformation is 
essentially optimal, in the worst case. 

The primary open question is to remove the proximity-dependence from the 
graph properties used in Theorem 1. In particular, for any (5 > 0, does there exist 
a single graph property 11 such that testing 77 requires Q = fi{q^~^) queries 
for all e > 0? It also remains to show whether there exists a nearly-quadratic 
separation when the adaptive algorithm is only allowed one-sided error. One 
approach to both of these questions is to prove an 0(e~^) upper-bound for the 
adaptive query complexity of BUC{H) for general graphs. 

Finally, we reiterate the intriguing question raised in [7] as to what rcilation- 
ships are possible between the adaptive and non-adaptive query complexities. 
Specifically, do there exists properties such that Q = 0{q^~^), with 5 ^ f ? Iii 
particular, is it true that Q must either be 0{q) or Q{q'^l^)l 

Acknowledgments: Thank you to Oded Goldreich and Chris Umans for very 
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V 



(a) A witness against v being {A, 0)-partitionable consists of two sets of 
vertices. The first set consists of Z\ + 1 vertices u\, . . . ,ua+i G r{v). The 
second set consists of ("^J^) vertices Wij such that Wij e r{ui) A r{uj) for 
each i 7^ j. When v is not (zi, a)-partitionable, there exist atN choices for 
each Ui such that Ui ^ Cv,a(uj) for any j 7^ i, so Algorithm 10 can find such 
vertices efficiently. Having chosen such a set, there exist aeN choices for Wij 
for each i ^ j, allowing algorithm 10 to complete the witness efficiently. Steps 
2-8 search for witnesses of this type. 




(b) If G has not been rejected by the first half of Algorithm 10, then V can be 
partitioned into clusters Vj , such that each pair of clusters either nearly forms 
a complete bipartite graph or is nearly disjoint. A minimal witness against 
having the correct high-level structure consists of a representative vertex from 
each set such that the induced subgraph is inconsistent with BUC{H). For 
example, when H = K4, is the complete graph on 4 vertices, the four vertices 
shown here form a minimal witness. Steps 9-16 search for witnesses of this 
type. 

Fig. 1: The two types of witnesses used by Algorithm 10 to determine whether 
G is a member of BUC{H). In each diagram, a line indicates that the algorithm 
queried that edge, with a soUd line indicating the presence of an edge and a 
dashed hne indicating the absence of an edge. 



Appendix 



A Formal Proofs for Section 3 
A.l Proof of Lemma 12 

Consider the j^N vertices which are not (/i, a)-partitionable. By assumption, 
at most -^N of them have degree greater than ZceN . For otherwise, G would 
have distance at least \ ■ 2ceN ■ = |A^^ from LV. 

Therefore, with high probability, some iteration chooses a vertex v in step 
3 that is not (A, Q;)-partitionablc and has degree at most 3ceN. Consider that 
iteration. Note that since v is not (Z\, a)-partitionable, \r{v)\ > aeN. 

We view 5 as U . . . U S^+i, with l^jj = a~^e~^. With high probability, 
Fs-iiv) 7^ 0, so let Ml be an arbitrary vertex in Fs^iv). Since v is not {A,a)- 
partitionable, r{v) contains at least aeN vertices u' such that \r{ui) A r{u')\ > 
aeN, and so with high probability / (v) contains such a vertex. Let U2 be that 
vertex. We continue in this manner, selecting vertices U3, . . . ,ua+i such that 
\r{ui) A r{uj)\ > aeN for elli^^ j. 

If |-r'5(u)| < cAa~^, then S = Fsiv). Otherwise, the probability that each 
vertex in S' is a valid choice for Ui, given ui, . . . , is at least ^, so with high 
probability S contains the desired A + 1 vertices. 

Viewing T as \JtfJlT^3, we see that, with high probability, T contains a 
vertex in r{ui) A r(uj) for all i j. The resulting view of G is therefore 
inconsistent with BUC(H) and the algorithm rejects, as desired. □ 

A. 2 Proof of Lemma 13 

First, delete all edges adjacent to any vertex which is not {A, a)-partitionable and 
add those vertices to L. The total cost of doing so is at most j^N-ceN+ = 
^Af2, since G is | close to LV^. 

The construction now proceeds in stages, with each stage constructing a 
V' = \Jj y/ such that E{V\ V\V') = 0. 

To construct = [Jj Vj- , choose an arbitrary vertex v with |-r(w)| > |A'' 
and {A, Q;)-partition r(v) into UjLi C'(wj)- For any j such that |C(mj)| > j^^N, 
add C{uj) to as a distinct VJ. Note that since \r{v)\ > ^N, there must be 
at least one partition of that size. 

While there exists a uj in such that | > |iV, choose such 

a vertex. For consistency of notation, we relabel Uj to v. Let UjLi ^i'^j) be 
a (Zi, a)-partitioning of r{v). Again, there must be a component such that 
\C{uj)\V^ \ > j^N. We add those C{uj)\V^ to as distinct F/s. 

Once every uj in has |r(uj)\y^| < |A^, we are ready to finalize V^. 
We begin by separating from V\V^ by deleting E{V^ ,V\V^), at a cost per 
vertex of at most + aeN. We next ensure that E{Vj-, Vj,) is either V^^ x Vj, 
or the empty set, for all This can be easily shown to require at most aeN 
edits per vertex. 

Note that now satisfies conditions 2-5, by construction. 

We now repeat this entire process on the remaining vertices, creating V^, 
V^, . . ., until every vertex in V has either been covered or has degree at most 



|A''. Finally, we delete all edges adjacent to the leftover vertices, at a cost of 
per vertex, and add those vertices to L. 

The total cost of this procedure is bounded by {^N + 2aeN)N < ^N'^, for a 
final cost of yA''^, as desired. □ 

A. 3 Proof of Lemma 14 

First, delete |A^^ edges so that G is in £X'2c6- Note that the resulting graph is 
still f-far from BUC{H). 

Consider some Wk- Completely disconnecting all vortices contained in Wk 
requires at most W{2ceN)^ deletions, since there are at most W components 
and, by Lemma 13, each component contains at most 2ceN vertices and each 
vortex has degree at most 2ceN. 

Therefore, if there are less than (16V[^c^e)~^ disjoint Wk, the total cost of 
deleting all witnesses in G is at most {IGWc^e)-^ ■ W{2ceN)'^ = fiV^. This 
contradicts the assumed distance to BUC{H), and the lemma follows. □ 

A. 4 Proof of Lemma 11 

If G is rejected with probability at least 2/3 by the first part of Algorithm 10, 

then we are done. So suppose otherwise. Then by Lemma 12, there exists G 
satisfying the conclusion of Lemma 13. Since G is e-far from BUC{H), G is at 
least f-far from BUC{H). 

By Lemma 14, there exist {16Wc^e)~^ sets Wk = IJ^ ^j^e corresponding to 

witnesses. Call a Wk high- degree if, for some VJ^ ^ C Wk contains ^f^N vertices 

of degree greater than 65AWc^eN . We note that at most half of the Wk arc high- 
degree. Otherwise, G would have distance at least ^ • {16Wc^e)~^ ■ GiAWc^eN ■ 
633^ = to CVce, contrary to assumption. 

We therefore restrict our attention to the (32Wc^e)~^ witnesses which are 
not high-degree. 

Let VI, . . . ,Vl, (. < W, be components corresponding to a partial witness in 

G, and let vi, . . . ,Vi be arbitrary vertices in V^, . . . , V^', respectively, such that 
< 65AWc^eN for each j < £. Let Vl_^_^ be a component adjacent to 
U^=i which extends the partial witness in G. Such a component must exist, 
since witnesses are connected. 

Since |V7+i| > jiz-^^ there are at least j§^N vertices which progress the 
witness (in G). Recall that if Vj is adjacent to VJ, in G, then every vertex in VJ 
must be adjacent in G to all but at most aeN vertices in VJ, (and similarly if 
Vj is not adjacent to Vp). Therefore, since W < ^ \Hf and a = (16Z\ 
there must be at least j^^N — WaeN > ^^^N choices for V£^i e V^^i such 
that E{vj,Vi^i) = E{vj,Vi^i) for all j < £, which means that v^^i progresses 
the witness in G as well. Of these, at least ^-f^N must also have degree at most 
65AWc^eN. 

Since |Tj| = 0{A~^€~^), with high probability rTi{U) contains such a vertex 
in step 13. Since each of the I vertices selected so far is adjacent to at most 65ceN 



vertices, there are at most £65ceN candidates for v^+i, and so the probabiUty that 
the selected w^+i progresses the witness in G is at least ^^j^ = f2{{W'^c)~^). 

Therefore, with high probability, the algorithm finds a complete witness in 
some iteration of steps 10-14 and rejects G. □ 



B Proof of Correctness for the Non- Adaptive Tester 

As in the proof for the adaptive algorithm, we first show that if G contains 
many vertices which are not {A, a)-partitionable, then G is rejected with high 
probability. 

Lemma 20. Let A = deg{H), W = W{H), and a = {16A\Hf)-^. Suppose 
that G is -^-close to £2?ce md contains at least j^N vertices which are not 
{A, a)-partitionable. Then Algorithm 16 rejects in step 4 with probability at least 
2/3. 

For ease of exposition, wo begin by proving the claim under the stronger 
assumption that G E CVct- Wc then show how to modify the argument to only 
require that G be 0(e)-closc to CVcc- 

Proof. We view the algorithm as choosing S one vertex at a time. As the sam- 
ples are chosen, we maintain an approximate partitioning of V into f2{e~^) sets 
Ua; U L, such that |t/fc| > aeN for all k and no vertex belongs to more than 
A of the Uk- For each fc, we initially say that Uk is uninitialized. The first time 
that we select v G Uk, we say that Uk is initialized with seed v, after which we 
require that Uk C r{v). 

Initially, let L consist of all {A, Q;)-partitionable vertices. Arbitrarily divide 
the remaining j^N vertices evenly between the Uk- 

Whenever u G Uk is chosen, we update the partitioning as follows. If Uk is 
uninitialized, then u must not be (A, Q;)-partitionable (by construction). We asso- 
ciate Uk with u and set Uk = r{u). Note that since u is not {A, Q:)-partitionable, 
> aeN as required. We also update the remaining uninitialized Uk' by re- 
moving any vertex v G Uk' such that v G r{u) or \r{u) A r{v)\ < ^aeN. Finally, 
we rebalance the sizes of the uninitialized Uk'- Supposing for the moment that 
G G jCVct, we remove at most |-r(ii)| • ceN/^aeN < 2c^a~^eN vertices. 

If vertices v,ui, . . . ,Ui, i < A, have been chosen from Uk, let 

e 

Uk = r{v)\ y a,a(«i)- 

Note that this ensures that Ui G r{v) and \r{ui) A r{uj)\ > aeN for all 
Furthermore, since v is not (Z\, a)-partitionable, we must have |i7/c| > aeN. 

We continue this way until either (i) A + 2 samples have been chosen from a 
single Uk or (ii) at least half of the vertices in some initialized Uk belong to at 
least A other initialized Uk'- We claim that this occurs within 0(e~^+^/('^+^)) 
steps. 



To see this, suppose that condition (ii) has not occurred. First note that 
either {8c^a~^e)~^ of the Uk have been initiahzed, or at least vertices are 
contained in the uninitiahzed C/^. Therefore, 



>i8c'a-h)-'.^N=^^N = f2{N), 



so 0(1) samples from V suffice to sample from Uk- By the birthday paradox, it 
follows that 0((f e)-i+i/('^+2)) = o(e-i+i/(^+2)) samples suffice to obtain the 
desired (A + 2)-way collision, with high probability. 

Suppose that we halted due to having chosen v,Ui,. . . , u^+i from some 11^. 
Then with high probability, T contains a vertex from r{ui) A r{uj), for each 
and the algorithm rejects in step 4. 

If, instead, we halted due to condition (ii), then there exist ^^N vertices 
which are adjacent to of the seed vertices. With high probability, T contains 
such a vertex u. Let ui, . . . , va+i be the corresponding seed vertices. Recall that, 
by construction, \r{vi) A r{vj)\ > ^aeN for all seed vertices Vi,Vj. Therefore, 
with high probability, T contains a vertex in r{vi) A r{vj) for each i ^ j, and 
the algorithm rejects in step 4. 

Note that in the preceding argument, we only assumed that G £ CDce when 
bounding the number of vertices discarded after each step from the unitialized 
Uk' - Note, however, that each additional discarded vertex implies an additional 
iaeTV distance from CVce- Therefore, we delete at most an additional -^^N 
vertices. It follows that HJj. Uk\ = ^^{N), as required, and the rest of the proof 
goes through unchanged. □ 

By Lemma 20, either G is rejected with high probability by step 4 or we can 
apply Lc!nnna 13 to obtain G, as in the adaptive case. By Lemma 14, it follows 
that there are many distinct witnesses against G, which allows us to again apply 
the birthday paradox. ^ 

Specifically, let V = {Wk} be the set of witnesses guaranteed by Lemma 14. 
As was shown in the adaptive case, for any choice of vi, - - - ,ve corresponding 
to some Wk, there exist at least 3^21^ choices for ve+i which correctly extend 
Wk in G. Therefore, 0{Wc^A) samples sufBce to obtain a vertex which extends 
some witness. Applying the birthday paradox, we see that step 5 finds a complete 
witness with high probability, in which case the algorithm rejects in step 6. 

We now formalize this argument. 

Proof (Lemma 17)- 

If G is rejected with probability at least 2/3 by the first part of Algorithm 
16, then we are done. So suppose otherwise. Then by Lemma 20, there exists G 
satisfying the concludon of Lemma 13. So by Lemma 14, there exists at least 
(SWc^e)-^ distinct Wk- 

As in the proof of Lemma 11, let V^, . . . , V^, £ < W, be components cor- 
responding to a partial witness in G, and let Vi,. . . ,ve be arbitrary vertices in 



VI,..., VI, respectively. Let V^!^^ be a component adjacent to Uj=i which 

extends the partial witness in G. Such a component must exist, since minimal 
witnesses arc connected. 

Since |V^Yi| > iga-^! there are at least vertices which progress the 

witness (in G). Also recall that if V- is adjacent to V-, in G, then every vertex in 
V- must be adjacent in G to all but at most 3^37^2 vertices in Vp (and similarly 
if Vj is not adjacent to Vp ). Therefore, since W < \h? , there must be at least 

-Wj^^N > choices for v^+i such that E{vj,ve+i) = E{vj,ve+i) 

for all j < £, which means that vg+i progresses the witness in G as well. 

We now map step 5 onto the birthday paradox. Given vertices v^,. . . ,V( in 
Vpl^^, . . . , VJ^^, respectively, let Uk C VJ^^_^^ be the set of vertices which extend 

Wk in G. 

We let V = {Uk}. Since |f7fe| > 3^2-^ ^ ^^"^ ^fc disjoint, 

0{WcPA) samples from V suffice to obtain a sample from D. Therefore, by 
Lemma 15, 0(e~^+^/^) samples sufSce to obtain a complete witness, which 
means that 5 contains a complete witness with high probability and the algo- 
rithm rejects in step 6. □ 



C Proof of Lemma 19 

Let €1 = Yg|2i2 be the distance to CDce and 62 = | be the distance to BUC(H). 
Let = IJ V^* be the optimal decomposition of G with respect to BUC{H). 

First, we completely disconnect all Vp such that | < 33^- Next, we delete 
all superfluous edges and add all missing edges so that G G BUC{H), at total 
cost at most 62 A^^. Note that since [jVp is the optimal decomposition, every 
vertex must have been connected to at least half of its neighbors. Therefore, at 
worst this doubled the degree of every vertex, thereby doubling the distance to 
£1?. Note that G is now a valid blow-up collection. 

The total cost of the edits so far is bounded by e2iV^ + ^ | V^* | zi = 
e2N'^ + ^N^, since each component is adjacent to at most A other components. 

For each if \Vp\ > ceN, delete \Vp\ — ceN vertices from Vp. This clearly 
preserves membership in BUC{H). Furthermore, the cost of doing the deletion 
is exactly equal to the decrease in distance to CDce- To see this, note that 
every neighbor of Vp needed to delete at least that many edges into Vp , and by 
symmetry we can assume that they all delete corresponding edges. 

Finally, while any high-degree vertices remain, we do the following. First, 
choose u G Vp such that |-r(M)| > ceN and v € r{u), and delete v. Note that 
the cost of doing so is, at most, ceN ■ A and the decrease in distance is at least 
\Vp\ > ^N, for a net multiphcative overhead of 3cZ\^. 

The total cost of removing the high-degree vertices is therefore bounded by 
ScA"^ ■ 2ei7V^, which means that the final cost is bounded by (£2-^^ + f-^^) + 
{ScA'^ ■ 2eiN'^) = eiV^ as desired. □ 



